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Technical Field 

This invention concerns the importation of data from external systems. 
In particular it concerns the importation of data from XML files. 

5 Summary of the Invention 

In a first aspect the invention, as currently envisaged, is a method for 
importing data from XML files, comprising the steps of: 

Specifying an XML file to be imported. 

Uploading the specified XML file. 
10 Parsing the file into a series of values for graphically representing the 

structure of the data. For instance each node of an information (DOM) tree. 

If necessary the values are corrected by a user inspecting the tree, into 
a format suitable to pass to the information tree 

The tree may be viewed by a user. 
15 The tree is saved by writing the data and metadata values to storage. 

The storage may consist of four tables ie ww_form-temp (metadata). 
ww_form_item_temp (metadata). ww_files_temp (data) and 
ww_objects_temp (data). 

The invention may be used to import and then view information from 
20 external systems. In a simple implementation an XML file may be imported 
without a DTD. Alternatively, in a more complex scenario the attributes of a 
corresponding DTD may be applied along with the presentation layer 
provided by XSL. 

The information may be imported in batch or real-time mode from an 
25 external system such as Oracle Financials, SAP or Peoplesoft. 

The imported information may be integrated with other systems 
without any code changes. 

In another aspect, as currently envisaged, the invention is a computer 
system for importing data from XML files, comprising in data storage: 
30 An Upload Servlet to upload a specified XML file. 

A Parsing Servlet to decipher the file into a series of values for 
graphically representing the structure of the data. 

A Saving Servlet to save the data and metadata values of the tree to 
storage. 

35 In a further aspect, as currently envisaged, the invention is a computer 

program, comprising: 
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An Upload Servlet to upload a specified XML file. 
A Parsing Servlet to decipher the file into a series of values for 
graphically representing the structure of the data. 

A Saving Servlet to save the data and metadata values of the tree to 
5 storage. 

Brief Description of the Drawings 

An example of the invention will now be described with reference to 
the accompanying drawings, in which: 
10 Fig. 1 is a flow chart showing the importation process. 

Fig. 2 is a table showing the effect of parsing an XML file. 
Fig. 3 is a table showing the structure of temporary storage tables. 
Fig. 4 is a representation of forms that have been identified. 
Fig. 5 is a representation of documents that could be produced. 
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Detailed Description of the Invention 

Setting up of an importation interface involves installing server side 
utilities as well as a once-off client side modification. The modifications 
needed on the clients is simply a matter of installing the Java Runtime 

20 Environment 1.2.2 (JRE), which includes appropriate plug-ins for both 

Netscape Navigator 4.6+ (Navigator) and Internet Explorer 5+ (IE5). Once 
this is set up, all Java 1.2.2 applets will run in IE5 and Navigator. 

Referring now to Fig. 1, the importation process 1 is started by a user calling 
a TrafficDirector Servlet 2 and specifying the XML file to be imported. This will 

25 typically require typing in the host address, port number and database driver 
to be used. A username and password may be required to satisfy the login 
credentials for the external database. The TrafficDirector Servlet 2 then calls an 
Upload Servlet 3 and provides it with the appropriate parameters. 

Once login to the external source has been achieved, then the 

30 hostname and database name will appear, and a list of all the accessible 
tables will also be created, along with a list of all accessible columns from 
the selected table. This is the table where the data is to be retrieved from. 

To limit the values which are available for selection, the user can 
create a criteria to determine which values will be available. 

35 An XML document usually includes or contains a reference to a 

Document Type Definition (DTD). Essentially a DTD defines the grammar for 
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a class of documents, that is, it contains markup declarations that describe 
the documents logical structure and the constraints within this structure. An 
example of a DTD and a valid XML document written to this DTD is as 
follows. This example will be referred to throughout the remainder of this 
5 document: 

Document Type Definition 

< ! ELEMENT orderlist (order*) > 

<! ELEMENT order (datetime,notes,salesperson, customer, part*)> 

< IATTLIST order id ID #REQUIRED> 
10 < ! ELEMENT datetime (#PCDATA) > 

< ! ELEMENT notes (#PCDATA)> 

< IELEMENT salesperson (name,department,phone)> 

< IATTLIST salesperson id ID #REQUIRED> 

< IELEMENT customer (name,address, phone) > 
15 < IATTLIST customer id ID #IMPLIED> 

< IELEMENT part (name,quantity,price)> 

< IATTLIST part id ID #REQUIRED> 

< IELEMENT name (#PCDATA)> 

< IELEMENT department (#PCDATA) > 
20 < IELEMENT phone (#PCDATA) > 

< IELEMENT address (#PCDATA)> 

< IELEMENT quantity (#PCDATA)> 

< IELEMENT price (#PCDATA)> 



25 Sample XML DOCUMENT 

<?xml version="1.0" encoding="iso-8859-l"?> 
<!DOCTYPE orderlist SYSTEM "orderlistdtd"> 
<orderlist> 
30 < order id = "_5449431"> 

<datetime>Feb 1 2000 5:37PM</datetime> 
< notes > We need to hurry this order through... </notes> 
< salesperson id = M 37"> 
< name > Jill Smith < /name > 



< department > Sales < /department > 
<phone>90991234</phone> 

</salesperson> 

< customer id = "909921"> 
< name > Bobs Plumbing</name > 
< address >1 George St, Sydney, 2000</address> 

< phone > 909956 78 < /phone > 
</customer> 

<part id = "10987"> 

< name > Widget Flange </name > 

< quantity > 100 </quantity> 

< price >0. 50 </price> 
</part> 

<part id = "10990"> 

< name > Widget Head Bolt < /name > 

< quantity > 100 < /quantity > 

< price > 2 .00 </price > 
</part> 

< /order > 

<order id = "_5449432"> 
<datetime>Feb 1 2000 5: 37PM < /date time > 
< notes > Take your time, this customer still hasn't paid last 
invoice. </notes> 

<salesperson id = "41"> 

< name > John Sparky < /name > 

< department> Sales </department> 

< phone > 90991235 < /phone > 
</salesperson> 

< customer id = "909989" > 

< name > Ke ns Hardware < /name > 

< address > 99 Ken St Sydney, 2000</address> 

< phone > 90999 10 K/p hone > 
</customer> 

<part id = "10969"> 

< name > Widget Rubber Seal</name> 
<quantity>200</quantity> 
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< price > 0.25 </price > 
</part> 

<part id="10899"> 

< name > Widget Spring < /name > 
5 < quantity > 10 </quantity> 

< price > 4.00 </price> 
</part> 

</order> 
</orderlist> 
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The Upload Servlet 3 uploads the specified XML file and calls a Parser Servlet 4 
which reads the file and deciphers it to produce a Document Object Model (as 
defined by W3C). The Document Object Model (DOM) provides programmatic 
access to the structure and content of the data being imported. In practice this 
15 means converting it into a series of values representing each node of an information 
(DOM) tree; as shown in Fig. 2. 

The values are then passed to an XMLToData Converter Servlet 5 
which ensures the values retrieved from the Parser 4 are in the correct format 
to pass to the information tree. The tree may then be viewed by the user 
20 using a Display Tree Servlet 6. 

If the tree is to be saved it is written to temporary storage 7. The 
temporary storage areas basically consist of four tables ie ww_form-temp 
(metadata). ww_form_item_temp (metadata). ww_files_temp (data) and 
ww_objects_temp (data); the table structure is shown in Fig. 3. 
25 Upon saving the XML tree the metadata and data values need to be stored. 

The relationship between parent-child and individual fields on a form is quite 
simple. All tags that appear at the same tree level are fields on the same form. If a 
tag is identified then it has a parent node. 

Once an XML document has been received from an external source it 
30 can be fed into a data driven application comprised of: 

o Metadata - The forms (templates) required to publish content 
o A Home - The folders defined to hold the published content 
o Search Facilities - Automatic access to search facilities specifically 
tailored for the structure of the content published. 
35 o Content - The published content. 

o Workflow - A workflow process to direct published content. 



This task involves the following steps: 
1. Create new metadata (Form templates) by analysing the DOM's structure. 

Given that XML data is hierarchical in structure, the metadata produced 
will also be hierarchical, that is, the forms will be built on parent/child 
relationships. Identifying the forms required involves a traversal of the DOM 
tree using the following criteria: 

Start with the root node. 

Any node with only a single value becomes a new field on the 
current form. 

Any node with more than one child (the value of a node is 
represented as a child) requires a new form, a child form. 
This process is recursive as we walk through the DOM structure: 



begin 

node = getRootNode 
createForm(node) 

end 



sub createForm(node) 
begin 

for each child of this node 

if child node has more than one child of it's own 

newForm = createForm(child) 

thisForm.addChild(newForm) 
else 

newField = createField(child) 
thisForm.addField(newField) 
endif 
end for 
end 



Given this process and the sample XML document presented, the forms 
shown in Figure. 4 can been identified: 



2. Create a home for it and associated workflow. 
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The home is essentially a folder structure in which each folder has a 
defined purpose. A home for the sample imported looks as follows: 

& Widget Orders 

!& All- A folder to contain all content published 

& Search- A means of accessing the automatic search facility 

for this content. 
& Publish- This folder contains the form required to publish 
new content. 



In order to publish content a workflow process also needs to be 
defined. At its simplest, the workflow for content imported from an 
external XML source is 'direct to repository'. That is, given generic XML 
we are unable to identify an individual or individuals for the workflow 

15 process. 

3. Populate it with content extracted from the DOM using the metadata 
defined in step 1. 

Populating means building a set of documents from the XML content 
20 imported based on the forms defined in step 1. 

Unlike the process of creating the metadata (the forms), which was 
driven by the structure of the DOM, this process is driven by the structure of 
the new forms. 

Again, this process is recursive as we walk through the form structure: 

25 

begin 

node = getRootNode 
form = getParentForm 
createDocumentfnode, form) 

30 end 

sub createDocument(node, form) 
begin 

for each field in this form 
35 get all children of current node that have same name as 

form field for each child node 
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newDocField = createDocField(childnode, formfield) 
thisDocument.addField(newDocField) 
end for 
end for 

for each child form of the current form 

get all children of current node that have same name as 
child form for each child node 

newDocument = createDocumentfchildnode, 
childform) 

thisDocument.addChild(newDocument) 
endfor 
endfor 

end 

15 Given this process and our sample XML document, the documents shown 

in Figure. 5 would be produced. 

Having created the building blocks it remains to map the objects created 
to the underlying relational database. 

20 It will be appreciated by persons skilled in the art that numerous 

variations and/or modifications may be made to the invention as shown in 
the specific embodiments without departing from the spirit or scope of the 
invention as broadly described. The present embodiments are, therefore, to 
be considered in all respects as illustrative and not restrictive. 

Dated this tenth day of March 2000 

i - Launch Pty Ltd 
Patent Attorneys for the Applicant: 

F B RICE & CO 



5 



10 



25 



T^^c D.'r^cfw 
2_ 

7 



3 



■ T 



-4/ 




4 



7 



J 



F*. / 



Parameters: in 


T51>e<3f Field 


XMLFileName 


Clob (stream of characters] 






Parameters Out 


Type Of Field 


XMLNodeld 


Integer (long) 


XMLParentNodeld 


Integer (long) 


XMLNodeName 


String (2000) 


XMLUrlToPerform 


String (2000) 



Fig. 2 



Table Name - ww form temp 


i Cblumn Name 






form temp id 


Number (9) 


User id 


Varchar2(50) 


Form name 


Varchar2(255) 


Description 


Varchar2(5l2) 


Title 


Varchar2(2000] 


Title XML tag 


Varchar2(255) 


Parent form temp id 


Number (9) 




Table Name - ww form item temp 


Column Name 


Type 


form temp item id 


Number(9) 


Form temp id 


Number(9) 


Item text 


Varchar2(255) 


Item type 


Varchar2(l] 


Item XML tag 


Varchar2(255] 




Table Name - ww files temp > 


Column Name 


Type- ' . . / .• ■ • . •• =;•;*,'■: 


form temp id 


Number (9) 


Folder id 


Number(9] 


Form temp id 


Number(9) 


title 


Varchar2(2000) 




Table Name - : -w 


w objects temp 


iGoliimri Name :: ■ "-"X'JJ- : ^ y r'1;^ ■ ; 




Object temp id 


Number(9) 


File temp id 


Number(9) 


form temp id 


Number(9) 


value 


Varchar2(2O0O) 



Fig. 3 
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